Performance of 3D Deconvolution Algorithms on Multi-Core and Many-Core Architectures

نویسندگان

  • Cory W. Quammen
  • David Feng
  • Russell M. Taylor
چکیده

Deconvolution algorithms are commonly used to remove optical distortion from fluorescence microscopy images. Many such algorithms have been proposed, but those that produce the best image restoration results are iterative. Typically, each iteration involves one or more 3D convolutions, resulting in execution times of tens of seconds to several minutes for common image sizes on single-core computers. Fortunately, most of the constituent computational primitives in deconvolution algorithms are readily parallelized on shared memory architectures. In this paper, we analyze the performance of three deconvolution algorithms implemented on modern multi-core central processing units and on many-core graphics processing units. We discuss the computational primitives in the deconvolution algorithms and their implementations, and compare performance of the two implementations on recent parallel processing architectures.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Wind Turbine Transformer Optimum Design Assuming a 3D Wound Core

A wind turbine transformer (WTT) is designed using a 3D wound core while the transformer’s total owning cost (TOC) and its inrush current performance realized as the two objective functions in a multi-objective optimization process. Multi-objective genetic algorithm is utilized to derive Pareto optimal solutions. The effects of inrush current improvement on other operating and design parameters...

متن کامل

Efficient parallelization of the genetic algorithm solution of traveling salesman problem on multi-core and many-core systems

Efficient parallelization of genetic algorithms (GAs) on state-of-the-art multi-threading or many-threading platforms is a challenge due to the difficulty of schedulation of hardware resources regarding the concurrency of threads. In this paper, for resolving the problem, a novel method is proposed, which parallelizes the GA by designing three concurrent kernels, each of which running some depe...

متن کامل

Performance analysis of a 3D unstructured mesh hydrodynamics code on multi- and many-core architectures

Several next generation high performance computing platforms are or will be based on the so-called many-core architectures, which represent a significant departure from commodity multi-core architectures. A key issue in transitioning large-scale simulation codes from multi-core to many-core systems is closing the serial performance gap, that is, overcoming the large difference in single-core pe...

متن کامل

Ultra-Low-Energy DSP Processor Design for Many-Core Parallel Applications

Background and Objectives: Digital signal processors are widely used in energy constrained applications in which battery lifetime is a critical concern. Accordingly, designing ultra-low-energy processors is a major concern. In this work and in the first step, we propose a sub-threshold DSP processor. Methods: As our baseline architecture, we use a modified version of an existing ultra-low-power...

متن کامل

Performance comparison of designated preprocessing white light interferometry algorithms on emerging multi- and many-core architectures

Parallel computing has been a niche for scientific research in academia for decades. However, as common industrial applications become more and more performance demanding and raising the clock frequency of conventional single-core systems is hardly an option due to reaching technological limitations, efficient use of multi-core CPUs has become imperative. 3D surface analysis of objects using th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009